A critical look at software tools in corpus linguistics*1

نویسنده

  • Laurence Anthony
چکیده

Anthony, Laurence. 2013. A critical look at software tools in corpus linguistics. Linguistic Research 30(2), 141-161. Corpora are often referred to as the ‘tools’ of corpus linguistics. However, it is important to recognize that corpora are simply linguistic data and that specialized software tools are required to view and analyze them. The functionality offered by software tools largely dictates what corpus linguistics research methods are available to a researcher, and hence, the design of tools will become an increasingly important factor as corpora become larger and the statistical analysis of linguistic data becomes increasingly complex. In this paper, I will first discuss how separating the data from the tools resolves various issues that are hotly debated within the field. Next, I will offer a critical look at the development of four generations of corpus tools, discussing their strengths and weaknesses. Then, I will discuss the role of programming in corpus linguistics tools creation and present a model for the development of future corpus tools. Finally, I will show a real-world example of a next-generation corpus tool that was developed for use in language learning. (Waseda University)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Corpus-Based Study of the Lexical Make-up of Applied Linguistics Article Abstracts

This paper reports results from a corpus-based study that explored the frequency of words in the abstracts of applied linguistics journal articles. The abstracts of major articles in leading applied linguists journals, published since 2005 up to November 2001 were analyzed using software modules from the Compleat Lexical Tutor. The output includes a list of the most frequent content words, list...

متن کامل

Increasing Interoperability for Embedding Corpus Annotation Pipelines in Wmatrix and other corpus retrieval tools

Computational tools and methods employed in corpus linguistics are split into three main types: compilation, annotation and retrieval. These mirror and support the usual corpus linguistics methodology of corpus collection, manual and/or automatic tagging, followed by query and analysis. Typically, corpus software to support retrieval implements some or all of the five major methods in corpus li...

متن کامل

Corpora and English Teaching: Retrospect and Prospect

As a whole system of methods and principles of how to apply corpora in language study, corpus linguistics has revolutionized nearly all branches of linguistics. In the wake of this revolution, people began to rethink language pedagogy from corpus perspective in early 1990s. However, Today, although Corpus Linguistics has contributed much to English education, difficulties do exist, especially i...

متن کامل

Sources of variability relevant to the cognitive sociolinguist, and corpus- as well as psycholinguistic methods and notions to handle them

8 This paper is a plea for sociolinguistics to integrate both theoretical and methodological developments from cognitive linguistics and, 9 even more importantly, psycholinguistics. More specifically, I argue that theoretical advances involving exemplar-based models and new 10 methodological tools from psycholinguistics (regressions, in particular mixed-effects models) and corpus linguistics (i...

متن کامل

The Vocabulary Profile of Iranian English Teaching School books

This paper provides a fairly detailed corpus-based vocabulary profile of the Iranian EFL books used in public schools. To this end, the WordPerfect files of all the seven books were converted to text format to get rid of the formatting features and be compatible with the software used for analysis. The software tools used were the Compleat Lexical Tutor suite, version 6.2 (Cobb, 2011), AntConc ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013